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The parton distribution functions (PDFs) of the proton, a necessary input to almost all theory predictions for 
hadron colliders, are reviewed in this document. An introduction to the PDF determination by global analyses 
of the main PDF fitting groups with an emphasis on HERA PDFs is presented. Finally, theory predictions 
based on different PDFs are compared to some recent relevant LHC and TEVATRON measurements. 



1. INTRODUCTION 

A precise knowledge of the Parton Distribution 
Functions (PDFs) of the proton is essential in order 
to make predictions for the Standard Model and be- 
yond the Standard Model processes at hadron collid- 
ers. The parton density function fi(x,Q 2 ) gives the 
probability of finding in the proton a parton of flavour 
i (quarks or gluon) carrying a fraction x of the proton 
momentum with Q being the energy scale of the hard 
interaction. Cross sections are calculated by convo- 
luting the parton level cross section with the PDFs. 
Since QCD does not predict the parton content of the 
proton, the shapes of the PDFs are determined by a 
fit to data from experimental observables in various 
processes, using the DGLAP evolution equation [lJ. 
The knowledge of proton PDFs mainly comes from 
the Deep Inelastic Scattering (DIS) HERA, fixed tar- 
get and TEVATRON data. The recent precise TEVA- 
TRON and LHC data has a potential to improve con- 
straints of the PDFs further. For example, through 
the W lepton asymmetry the different quark contri- 
butions can be accessed and with inclusive jet produc- 
tions it is possible to constrain the gluon PDF. 



2. PARTON DISTRIBUTION FUNCTIONS 
2.1. Proton Structure and DIS 

Deep inelastic lepton nucleon scattering probes the 
structure of matter at small distance scales. At HERA 
(Hadron Elektron Ring Anlage), DESY, electrons (or 
positrons) were collided with protons at centre-of- 
mass energies yfs = 225 — 318 GeV. Two different 
deep inelastic ep scattering processes are measured at 
HERA: neutral current (NC), ep —> eX, and charged 
current (CC), ep —> vX. In neutral current reactions 
the interaction proceeds via the exchange of a photon 
or a Z boson while in charged current scattering a 
boson is exchanged (figure [IJ . 

The NC (and similarly CC) cross section can be ex- 




Figure 1: Diagrams of neutral NC and charged CC 
current deep inelastic scattering processes. The symbols 
denote the particles, the label "X " denotes the hadronic 
final state. 

pressed in terms of structure functions: 

d2<7 NC _ 2na2 \ Ym F ± t Y xF ± - v 2 F±] 
dxdQ 2 ~ xQ* \- Y+t2 TY - X *3 y *iJ' 

where Y± = 1 ± (1 — y) 2 with y being the inelastic- 
ity. The structure function F% is the dominant con- 
tribution to the cross section, xF% is important at 
high Q 2 and Ft is sizable only at high y. In the 
framework of perturbative QCD the structure func- 
tions are directly related to the parton distribution 
functions, i.e. in leading order (LO) F2 is the mo- 
mentum sum of quark and anti-quark distributions, 
F 2 « xJ2 e li a + <?)) and xF$ is related to their dif- 
ference, xF% w x Y^, 2e q a q (q — q). At higher orders, 
terms related to the gluon density distribution (a s g) 
appear. 

In analogy to neutral currents, the inclusive CC ep 
cross section can be expressed in terms of structure 
functions and in LO the e + p and e~p cross sections 
are sensitive to different quark densities: 

e+ : ct^ + c = x[u + c] + (1 - y) 2 x[d + s] 
e~ : ctqc — x[u + c] + (1 — y) 2 x[d + s]. 

The measurements of the NC and CC cross sections 
from HERA extend the kinematic regime in Q 2 by 
more than two orders of magnitude with respect to 
the fixed target experiments. 
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2.2. Determination of PDFs 

The general procedure to determine PDFs is as fol- 
lows. Starting from a parameterisation of the non- 
perturbative PDFs at a low scale, either by making 
ad-hoc assumptions on their analytical form or by 
using the neural-net technology, fits to various sets 
of experimental data (mainly to DIS data) are per- 
formed within the DGLAP evolution scheme. The re- 
sulting PDFs depend on the choice of the input data, 
the order in which the perturbative QCD calculation 
is performed, the assumptions about the PDFs, the 
treatment of heavy quarks, the correlation between 
a s and the PDFs and the treatment of the uncertain- 
ties. Presently, the determination of PDFs is carried 
out by several groups, namely MSTW_0, CTEQ H, 
NNPDF d, HERAPDF [5], AB(K)M @ and GJR Q. 
In the following, a short description of each group with 
the relevant data sets used in PDF fits is given. 

The MSTW (Martin-Stirling-Thorne-Watt) PDFs 
are determined from a global analysis of hard- 
scattering data within the framework of leading- 
twist fixed-order collinear factorisation in the MS 
scheme 0. PDF sets in LO, NLO and NNLO are 
available. This determination is based on the HERA 
DIS data (with the exception of the latest combined 
HERA I data) as the main input for constraining 
quarks and gluon PDFs at low x, fixed target DIS data 
constraining quarks and gluons in the high-x region, 
fixed target Drell-Yan (DY) data which help to con- 
strain high- a: sea quarks, TEVATRON jet data con- 
tributing to high-x gluon PDF and W, Z data which 
provide an access to the different quark contributions. 

The CTEQ PDFs [| are obtained by a global analy- 
sis of hard scattering data in the framework of general- 
mass perturbative QCD. This analysis is based on the 
same data mentioned above (CT10 also includes the 
recent combined HERA I data and more TEVATRON 
data). PDF sets in NLO and NNLO QCD are avail- 
able. 

The Neural Net PDF (NNPDF) group 0] 
parametrise non-perturbative PDFs by training a neu- 
ral network (i.e. minimising the \ 2 ) on MC replicas of 
the experimental data. With such a method NNPDFs 
are free of assumptions made by other groups (how- 
ever there is some ambiguity from the procedure it- 
self). It also includes almost all above mentioned ex- 
perimental data in the fit and provides LO, NLO and 
NNLO PDFs. 

The HERAPDFs [| are based only on HERA DIS 
data which allows the usage of the conventional x 2 
tolerance of A\ 2 = 1. Since this analysis is solely 
based on ep data, the PDFs do not depend on the ap- 
proach for nuclear corrections needed for fixed target 
data. The HERAPDF has been extracted in NLO and 
NNLO. More details about HERA PDFs are given in 
the next section. 

The ABKM Q group provides NLO and NNLO 



PDFs based on the inclusive NC DIS world data from 
HERA and from fixed-target experiments. They also 
provide the strong coupling constant a s in NNLO. 

The GJR 7] group uses a more restrictive dy- 
namical approach in the PDFs determination, i.e. a 
dynamical model is used in the evolution of the input 
distributions starting at very low Q 2 . Fits to DIS, 
DY and TEVATRON jet data provide LO, NLO and 
NNLO PDF sets. 

The large number of PDF parameters and their 
treatment in the fitting procedure within the different 
groups results in differences of the PDFs provided. 
In order to study these differences, a benchmarking 
exercise is being carried out by the PDF4LHC 
working group formed by the members of the PDF 
fitting groups mentioned above. As an example of 
this exercise, the NLO prediction of the Higgs cross 
section (M H = 120 GeV) for the LHC is shown in 
figure [5] for different PDF sets as a function of a s . 
It can be seen that the different PDF groups are 
using different default values of a s and the cross 
section predictions differ by a few percent when 
using the default values of a s as well as when using 
the same value for a s . Furtehermore, the different 
predictions have different uncertainties (e.g. HERA 
PDF has asymmetric uncertainties due to considered 
parametrisation variation as explained in the next 
section). 

Further comparisons and an overview of the 
PDF4LHC working group activities can be found 
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Figure 2: NLO Higgs cross section predictions 
(M H = 120 GeV) using different PDFs at the LHC with 
7 TeV. 



2.3. HERA PDFs 

In the HERAPDF1.0 fit [|, the combined set on 
NC and CC ep inclusive cross-sections for the first 
running period of HERA (HERA I) is used. The 
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full statistics HERA inclusive CC and NC data are 
used for NLO and NNLO QCD fits resulting in 
HERAPDF1.5 Q. The same formalism, model and 
paramatrisation assumptions as in the HERAPDF1.0 
are used in the HERAPDF1.5(NLO) fit. 

The QCD predictions for the structure functions are 
obtained by solving the DGLAP evolution equations 
at NLO (or NNLO) in the JlS scheme with the 
renormalisation and factorisation scales chosen to be 
Q 2 . The DGLAP equations yield the PDFs at all 
values of Q 2 above the input scale Qq at which they 
are parametrised as a functions of x. The starting 
scale Qq is chosen to be 1.9 GeV 2 such that the 
starting scale is below the charm mass threshold. 
The QCD predictions for the structure functions are 
obtained by convolution of the PDFs with the NLO 
coefficient functions calculated using the general mass 
variable favour number RT scheme [10j. 
For the parametrisation of PDFs at the input scale 
the generic form xf(x) = Ax B (l — x) c (1 + Ex 2 ) is 
used. The parametrised PDFs are the gluon distribu- 
tion xg, the valence quark distributions xu v , xd v , and 
the u-type and d-type anti-quark distributions xU, 
xD. At the starting scale Qq = 1.9 GeV 2 xU = xu 
and xD = xd + xs. The central fit parametrisation is: 



HI and ZEUS 



xg{x) 


= A g x B °(l- 




xu v {x) 


= A Uv x B ^-(l 


- x) c ^{l + E Uv x 2 


xd v (x) 


= A dv x B ««(l 




xU{x) 


= A x B Hl- 




xD(x) 


= A D x B *(l- 


-x c °). 



The normalisation parameters A are constrained by 
the quark number sum-rules and momentum sum- 
rule, extra constrains for small- a; behaviour of 



type quarks B u 



Bd,. , Bf 



Bp, and 



Aq{1 — f s ) (f s is the strange quark distribu- 



and u- 
A = 

tion) which ensures that xu — > xd as x — > 0. 

The break-up of the HERA PDFs into different 
flavours is illustrated in figure El Model uncertainties 
(shown as yellow bands in the figure) of the central 
fit solution is evaluated by varying the input assump- 
tions: Q 2 min , f s , mass of heavy quarks mc and tub- 
Parametrisation uncertainties (green band) is formed 
by an envelope of the maximal deviation from the cen- 
tral fit varying parametrisation assumptions in the fit 
and therefore has an asymmetric shape. The deter- 
mination of parameterisation uncertainties are unique 
to HERAPDFs. 

An example of the parton distribution functions from 
HERAPDF1.5 at NNLO is shown in figure 0] HER- 
APDF1.5NLO and NNLO sets are the recommended 
HERA PDFs to be used for the predictions of pro- 
cesses at LHC. 



Q 2 = 10 GeV 

HERAPDF1.0 

exp. uncert. 
| model uncert. 

I parametrization uncert. xu v 




Figure 3: The parton distribution functions from 
HERAPDF1.0 at Q 2 = 10 GeV 2 . The gluon and sea 
distributions are scaled down by a factor of 20. The 
experimental, model and parametrisation uncertainties 
are shown separately. 
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Figure 4: The parton distribution functions from 
HERAPDF1.5 NNLO at Q 2 - 10000 GeV 2 , i.e. a region 
relevant for the hadron colliders TEVATRON and LHC. 
The gluon and sea distributions are scaled down by a 
factor 20. The experimental, model and parametrisation 
uncertainties are shown separately. For comparison, the 
central values of HERAPDF1.0 NNLO are also shown. 



2.4. Comparisons to recent LHC and 
TEVATRON results 

The prediction of the Z boson rapidity distribution, 
based on three different PDFs, are compared to the 
CDF measurement in figure^] The predictions of the 
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Figure 5: Z boson cross section times the branching ratio 
to electrons as a function of the rapidity y, as measured 
by the CDF collaboration. The cross sections are 
compared to the theory predictions based on three 
different PDFs. The total PDF uncertainties on the 
prediction on HERAPDF1.5 is shown as a blue band. 
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Figure 6: The electron charge asymmetry as a function of 
the lepon pseudorapidity n, as measured by the DO 
collaboration. The measurement is compared to the 
theory predictions based on three different PDFs. The 
total PDF uncertainties on the prediction on 
HERAPDF1.5 is shown as a blue band. 



W boson lepton asymmetry, based on the same PDFs 
are compared to the DO measurement in figure [SI As 
can be seen from these figures, the predictions with 
the HERAPDF sets are in good agreement with data 
and the agreement is comparable to that obtained for 
the CTEQ66 and MRSTW08 PDF sets. 

Recently many new physics results are released 
by the LHC experiments. The high precision LHC 
data can be used not only to compare with theory 
predictions to differentiate between different PDF 
sets but can also be used to put constraints on them 
instead. For example, the W lepton charge asymme- 
try A can help to constrain the u to d quark ratio 
(A w = (W + -W~)/(W + + W~) can be approximated 
to the ratio Aw ~ (u v — d v )/(u v + d v + 2u sea ) and thus 
is sensitive to the valence u and d quark ratio), the 
top quark measurement would have impact on gluons 
in the high- a; region, inclusive jet measurements can 
be used to improve constrains on the gluon (the gg 
initial state has dominant contribution to the cross 
section at the low jet transverse momenta, Pt)- 
The W boson lepton asymmetry as measured by 
ATLAS, CMS and LHCb is shown in figure The 
measured asymmetry is compared to predictions ob- 
tained with MSTW08, CTEQ66 and HERAPDF1.0 
PDFs. The PDF uncertainties are shown as bands 
for each prediction. There is already a more precise 
W asymmetry preliminary measurement available 
from the CMS collaboration [111. 
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Figure 7: The W lepton charge asymmetry measured by 
ATLAS, CMS and LHCb (symbols) compared to the 
predictions based on different PDFs shown as lines with 
uncertainty bands. 



The inclusive jet cross section is sensitive to the gluon 
density and can help to constrain it. In figure [5] the 
inclusive jet cross section measured by the ATLAS 
collaboration is shown as a function of the jet 
transverse momentum in different rapidity regions. 
The theoretical predictions based on different PDF 
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sets are describing the data equally well. 
A similar measurement was performed by the CMS 
collaboration [12)]. Figure H] shows the inclusive 
jet cross section measured by CMS as a function 
of jet transverse momentum in the rapidity bin 
0-0 < y < 0.5. The measurement is compared to 
predictions calculated using different PDFs which are 
agreeing similarly well with the data as in the case of 
ATLAS measurement. 




Figure 8: Inclusive double- differential jet cross sections 
as a function of jet Pt in different rapidity regions for 
jets identified using the anti-kt algorithm with R — 0.6 
(ATLAS). The plot shows the ratio of the data to the 
theoretical predictions calculated with different PDFs 
(CTEQ 6.6 is used as a reference). 
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2.5. Recent Updates of HERAPDFs 

The HERAPDF1.5 analysis has been extended to 
include HERA inclusive jet data. The new PDF set 
with jet data is called HERAPDF1.6 13]. The jet 
data are sensitive to gluons and a s and therefore 
reduce the correlation between the gluon density and 
a s in the PDF fits. The impact of the jet data can 
be seen if a s (Mz) is treated as a free parameter 
in the fit (see figure [TU] and [TTj) . In case of fitting 
simultaneously PDFs and a s to the inclusive DIS 
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Figure 10: The parton distribution functions from 
HERAPDF1.5 at Q 2 = 10 GeV 2 with a s (M z ) treated as 
a free parameter in the fit. 
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Figure 9: The ratio of inclusive jet cross section ( CMS) 
for \y\ < 0.5 as a function of jet Pt to NLO predictions 
based on the CT10 PDF set. Additional predictions are 
shown using HERAPDF1.0 and ABKM09 PDFs. PDF 
uncertainties are shown as coloured bands. 



Figure 11: The parton distribution functions from 
HERAPDF1.6 (with HERA jet data included in the fit) 
at Q 2 = 10 GeV 2 with a s (Mz) treated as a free 
parameter in the fit. 
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data the uncertainties on the gluon PDF becomes 
large at low x but as soon as the jet data are included 
(figure [TTjl , the correlation between the gluon PDF 
and a s (Mz) is reduced, resulting in significantly 
reduced uncertainties on the gluon PDF similar to 
the case of using inclusive DIS data only and fixing 
a s (Mz). In the fit with jet data the value of a s was 
extracted: 

a s {M z ) = 0.1202 ±0.0013(exp)±0.0007(mod/param) 
±0.0012(hadronisation) (scale) [3. 

More details about QCD analysis with jets at 
HERA were presented in the talk by T. Schoerner- 
Sadenius [lj]. 

The effect of including the HERA charm data 
in the QCD PDF fits was also studied at HERA. In 
this study, the charm data are used together with 
different implementations of the variable flavour 
number schemes (VFNS), which have different ap- 
proaches for the interpolation function and counting 
of orders in a s . Different VFN schemes can have 
different impacts on the charm contribution to the sea 
quark distribution and thus affect the composition of 
xU (x) from the xu(x) and the xc(x) contributions. 
Therefore the accuracy of the charm data influence 
the uncertainties on the W ± and Z cross section 
predictions at LHC. Figure [T2l summarises the study 
by showing the charm quark mass m™ od (which is an 
effective parameter in the fit) scanning results for all 
schemes used in the fits. It is interesting to observe 
that first, different schemes have different optimal 
charm quark mass parameter m™ od , and second, the 
X 2 minimum values are comparable for all schemes 
despite of different optimal values of m™ od . 
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Figure 12: Comparison of the X 2 of HERA I + Ff fits 
using different heavy flavour schemes represented as lines 
of different styles. 



The PDFs with optimal m™ od were then propagated 
to the calculation of W and Z boson cross section 
predictions for the LHC. As an example, the W + 
cross section as a function of m™ od for the different 
schemes is shown in figure 1131 Good agreement 
between these predictions is observed at optimal 
m rriod wn i cri results in a reduction of the uncertainties 
due to the heavy flavour treatment to below 1.0%. 
More details about the study can be found in (l5j . 
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Figure 13: W + production cross section a w + at the LHC 
for y/s — 7 TeV as a function of m™ od . The lines show 
predictions for different VFN schemes as indicated by the 
legend. The stars show the predictions obtained with the 
optimal value of m™° d used in a given scheme. The thick 
dashed horizontal lines indicate the range of a w + , 



determined for m" 



(opt). 



Finally, a QCD fit was performed which includes 
all currently available HERA NC and CC inclusive, 
charm, low energy and jet data (HERAPDF1.7). 
This fit prefers the same a s value as obtained with 
a fit using only inclusive and jet data. Similarly, 
the same optimal me value was found in the HER- 
APDF1.7 fit as that obtained in a fit when only 
inclusive and charm data are fitted. The break-up 
of the HERAPDF1.7 PDFs into different flavours is 
illustrated in figure ITU 



3. SUMMARY 

Parton distribution functions of the proton are es- 
sential for any cross section prediction at hadron col- 
liders. PDFs are generally determined by global QCD 
analyses fitting various sets of experimental data with 
the main knowledge about the proton structure com- 
ing from DIS experiments. Six modern PDF sets are 
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HERA I+II inclusive, jets, charm PDF Fit 
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Figure 14: The parton distribution functions from 
HERAPDF1.7 at Q 2 = 10 Gel/ 2 . The gluon and sea 
distributions are scaled down by a factor of 20. The 
experimental, model and parametrisation uncertainties 
are shown separately. For comparison, the central values 
of HERAPDF1.6 are also shown. 
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Finally, comparisons of theory predictions with var- 
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